Improving financial data quality using ontologies

نویسندگان

  • Jie Du
  • Lina Zhou
چکیده

a r t i c l e i n f o The performance of financial decision-making directly concerns both businesses and individuals. Data quality is a key factor for decision performance. As the availability of online financial data increases, it also heightens the problem of data quality. In this paper, a taxonomy is created for data quality problems. More importantly, an ontology-based framework is proposed to improve the quality of online financial data. An empirical evaluation of the framework with the financial data of real-world firms provides preliminary evidence for the effectiveness of the framework. The framework is expected to support decision-making in finance and in other domains where data is spread across multiple sources with overlap but complementary in content. Today's widespread financial problems and the economic downturn highlight the importance of financial decision-making to individuals , businesses, and organizations. Intelligence gathering is the first stage of decision making [57], and data quality is a key factor for decision performance [29]. It is reported [63] that 20% of asset managers , investment bankers and hedge fund professionals spend between 25% and 50% of their time in validating data, which prevents them from focusing on tasks that contribute to the bottom line. According to a recent study of the costs and other consequences of dirty or inconsistent data in the secondary mortgage market in the U.S. [22], inaccurate data results in slow and expensive loan processing , weak underwriting, incorrect portfolio management, and other costs to lenders and mortgage investors. Given that financial data including financial statements, market data, and business news are being used increasingly by investors in stock market predictions [16,53], data quality has become an important and widespread issue in financial decision-making. The problems with financial data come in a variety of forms. The main problems of financial data include ambiguity, inconsistency, missing values, inaccuracy, misrepresentation, incompletion, and so on [40]. For instance, missing values are not uncommon in Standard & Poor's Compustat North America dataset. Such problems can directly impact the performance of financial decision-making. Under this backdrop, this study aims to answer the following research question: How should one address the quality problems of financial data so as to improve the performance of financial decision-making? Both qualitative and quantitative approaches have been proposed to address various types of data quality problems [42]. For example, missing values can be replaced with global means or the most …

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Using Domain Knowledge Provided by Ontologies for Improving Data Quality Management

Several data quality management (DQM) tasks like duplicate detection or consistency checking depend on domain specific knowledge. Many DQM approaches have potential for bringing together domain knowledge and DQM metadata. We provide an approach which uses this knowledge modeled in ontologies instead of aquiring that knowledge by cost-intensive interviews with domain-experts. These ontologies ca...

متن کامل

بررسی هستان شناسی های توسعه یافته مبتنی بر اصول هستان شناسی های منبع باز زیست پزشکی

Background and Aim: Ontologies facilitate data integration, exchange, searching and querying. Open Biomedical Ontologies (OBO) Foundry is a solution for creating reference ontologies. In this foundry, the design of ontologies is based on established principles which allow for their interactions as a single system. The purpose of this study is to determine the main features of ontologies develop...

متن کامل

Towards a Semantic Data Quality Management - Using Ontologies to Assess Master Data Quality in Retailing

A high quality of master data is vital for process automation and IS support in enterprises. Retailers and manufacturers have started improving their master data quality on a syntactical level. Many researchers address problems of syntactical master data quality (e.g. missing values, typo errors), but only few approaches target semantic issues of master data quality management. Poor semantic da...

متن کامل

A Clustering-Based Approach to Ontology Alignment

Ontology alignment is an important problem for the linked data web, as more and more ontologies and ontology instances get published for specific domains such as government and healthcare. A number of (semi-)automated alignment systems have been proposed in recent years. Most combine a set of similarity functions on lexical, semantic and structural features to align ontologies. Although these f...

متن کامل

Using Enterprise Reference Models for Automated ISO 9000 Compliance Evaluation

A computational enterprise model representing key facets of an organization can be an effective tool to consider when planning an enterprise information architecture. For example, a specific organization's quality management business processes and organizational structures can be represented using such a model, and then compared to a reference model of "good" processes and structures, such as t...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Decision Support Systems

دوره 54  شماره 

صفحات  -

تاریخ انتشار 2012